New Similarity Coefficients for Binary Data

نویسندگان

  • Viviana Consonni
  • Roberto Todeschini
چکیده

In the last few decades, the use of similarity measures has been becoming more and more important due to the relevance of comparing samples in order to find out clusters of similar samples, to generate priority lists, and, in general, to discover patterns in data structures. In drug design, their relevance is already well established to search for the most suitable alternative to a target drug. In the QSAR field they are currently the key factor in read-accross strategy along with the defined chemical space. Similarity indices for binary variables are usually called similarity coefficients and their first definitions date back to the end of the 19th century provided by scientists especially interested in taxonomic studies. Till date, more than 50 different similarity coefficients have been found in the literature, each having its own mathematical properties and characteristics and used in different scientific fields. In this paper, five new similarity coefficients for binary data are proposed and compared with some well-known similarity coefficients. MATCH Communications in Mathematical and in Computer Chemistry MATCH Commun. Math. Comput. Chem. 68 (2012) 581-592

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Surface Tension Model for Prediction of Interaction Energy between Components and Activity Coefficients in Binary Systems

In this work, we develop a correlative model based on the surface tension data in order to calculate thermodynamic parameters, such as interaction energy between components (Uij), activity coefficients and etc. In the new approach, by using Li et al. (LWW) model, a three-parameter surface tension equation is derived for liquid mixtures. The surface tension data of 54 aqueous and 73 non-aqueous ...

متن کامل

Privacy-preserving similarity coefficients for binary data

Similarity coefficients (also known as coefficients of association) are important measurement techniques used to quantify the extent to which objects resemble one another. Due to privacy concerns, the data owner might not want to participate in any similarity measurement if the original dataset will be revealed or could be derived from the final output. There are many different measurements use...

متن کامل

A Comparison of Multi-way Similarity Coefficients for Binary Sequences

The paper compares three formulations of n-way (for groups of size n ≥ 2) similarity coefficients for binary sequences. Properties that the similarity coefficients may have in general, not just for specific data, are discussed, and it is investigated how the different n-way formulations are related. Using the n-way Bennani-Heiser coefficients, the similarity between m sequences (2 ≤ m ≤ n) is a...

متن کامل

Machine Cell Formation Based on a New Similarity Coefficient

One of the designs of cellular manufacturing systems (CMS) requires that a machine population be partitioned into machine cells. Numerous methods are available for clustering machines into machine cells. One method involves using a similarity coefficient. Similarity coefficients between machines are not absolute, and they still need more attention from researchers. Although there are a number o...

متن کامل

Signal detection Using Rational Function Curve Fitting

In this manuscript, we proposed a new scheme in communication signal detection which is respect to the curve shape of received signal and based on the extraction of curve fitting (CF) features. This feature extraction technique is proposed for signal data classification in receiver. The proposed scheme is based on curve fitting and approximation of rational fraction coefficients. For each symbo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012